CDS

Accession Number TCMCG011C15371
gbkey CDS
Protein Id XP_021900261.1
Location join(214899..215180,218204..218341,218607..218688,218907..218983,219196..219302,220395..220574,220652..220748,220854..220945,221180..221246,221358..221436,221531..221622,221770..221913,221999..222028)
Gene LOC110816402
GeneID 110816402
Organism Carica papaya

Protein

Length 488aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA264084
db_source XM_022044569.1
Definition heparan-alpha-glucosaminide N-acetyltransferase isoform X2 [Carica papaya]

EGGNOG-MAPPER Annotation

COG_category S
Description Protein of unknown function (DUF1624)
KEGG_TC -
KEGG_Module M00078        [VIEW IN KEGG]
KEGG_Reaction R07815        [VIEW IN KEGG]
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K10532        [VIEW IN KEGG]
EC 2.3.1.78        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00531        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko04142        [VIEW IN KEGG]
map00531        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map04142        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGCCAATGTACGAGTCTATTAAGGGATATTGCAAGGAGGAAGATGAATGGCGGAGGAAGGGCGCGGGTAGAGGTAGTGATGAGGAGAGTAAGTGTCTGATGAATGACAAGGATCGTAAATGTGCCGCCGATGATTTCGAATCGGCTCTCCAGATCTCTCAATCGCCTCGCCTTCCGATTGCGAAACGTGATTCTCCTCTTTCTCTACAACAACAACAACAACAGCAGCAACAAACGCAACGGCTTGTTTCTCTCGACGTTTTTCGCGGACTCACCGTCGCGCTAATGATAGTTGTGGATGATATTGGAGGAATCCTACCTGCCATCAATCATTCACCATGGAATGGTTTAACCATTGCAGATTTTGTCATGCCATTTTTCCTCTTCATTGTTGGTGTTTCGCTGGCATTTGCATACAAGAAAGTGTCATGCAGATCAACTGCAACGAGAAAAGCTATACTTCGGGCATTGAAGCTCCTACTGTTAGGCCTTTTTCTTCAAGGAGGTTTCTTCCATGGTATCAAGAATTTAACTTATGGAGTCGACATTGAAAAAATGAGATGGTTGGGTGTACTTCAGAGAATTGCAGTAGCATATTTAGTAGCTGCTTTGTGTGAGATTTGGCTGAAGTGGGATGATCGTGTTGGTTCAGATCTATCTTTGATGAGAAAATATCAATATCATTGGGTTGTAGCTTTTGTGCTTACAACTACATATCTATCCTTGTTGTATGGCTTGTATGTTCCTGACTGGGAATACCAGATTCCAGTTGAAGCTCCTTCATCAGCACCACCACAGATATTTTCAGTGAGCCATCCTACTCTCCCGGGCCTTTTAGTTCCTTTCAACTTTTTGAAAATTGCAGATGACACTGGACCAGCTTGCAATGCTGTAGGAATGATTGATCGTAAAGTACTGGGCATTCACCATTTATATGCAAGGCCAATATATGCAAGAACCAAGGAATGCAGTATTAACTCGCCTGATTCTGGCCCTTTACCTCCGGACGCCCCTTCATGGTGTCAAGCACCCTTTGATCCAGAAGGACTTCTGAGTTCAGTGATGGCCATTGTTACCTGCTTGATTGGTTTGCATTATGGGCATACCATTGTCCATTTCAAGGATCACAGGGACAGAATTTTTCAGTGGATGATCCTATCATCCTGTCTCTTAGTCTTCGGCCTTGGCTTGGACATTTTTGGAATGCGTCTAAACAAGGCTCTCTATTCATTCAGTTATATGTGTGTCACTGCTGGTGCTGCTGGCATTCTCTTTGCTGCAATTTATGTTCTGGTTGATGTGTTCGGATATAGGCGCGTAAGTTTCGCATTGGAGTGGATGGGCGTGCATGCATTAATGATTTATATATTTGCAGCATGCAATATCTTACCTCTCATAGTGCATGGATTCTATTGGAGGCAGCCCCAGAATAATATTCTTAGCTTAGTTGGAATCGGAAGGAGATGA
Protein:  
MPMYESIKGYCKEEDEWRRKGAGRGSDEESKCLMNDKDRKCAADDFESALQISQSPRLPIAKRDSPLSLQQQQQQQQQTQRLVSLDVFRGLTVALMIVVDDIGGILPAINHSPWNGLTIADFVMPFFLFIVGVSLAFAYKKVSCRSTATRKAILRALKLLLLGLFLQGGFFHGIKNLTYGVDIEKMRWLGVLQRIAVAYLVAALCEIWLKWDDRVGSDLSLMRKYQYHWVVAFVLTTTYLSLLYGLYVPDWEYQIPVEAPSSAPPQIFSVSHPTLPGLLVPFNFLKIADDTGPACNAVGMIDRKVLGIHHLYARPIYARTKECSINSPDSGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLHYGHTIVHFKDHRDRIFQWMILSSCLLVFGLGLDIFGMRLNKALYSFSYMCVTAGAAGILFAAIYVLVDVFGYRRVSFALEWMGVHALMIYIFAACNILPLIVHGFYWRQPQNNILSLVGIGRR